8: 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



BEST AVAILABLE COPY 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
24 October 2002 (24.10.2002) 




PCT 



(10) International Publication Number 

WO 02/084250 A2 



(51) International Patent Classification 7 : G01N 

(21) International Application Number: PCT/US02/1 201,4 

(22) International Filing Date: 17 April 2002 (17.04.2002) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 
60/284,273 
60/284,713 



17 April 2001 (17.04.2001) US 

18 April 2001 (18.04.2001) US 



(71) Applicant (for all designated States except US): FEM- 
TOLINK BIOTECHNOLOGIES LLC [US/US]; 11930 
Heritage Oak Place, Suite 5, Aubum, CA 95603 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): EVERETT, 



Nicholas, P. [GB/US]; 604 Apple Tree Lane, Meadow 
Vista, CA 95722 (US). PETELL, James, K. [US/US]; 
9203 W. .Gage Boulevard, Kennewick, WA 89336 (US).' 
YOUNG, Scott, A. [US/US]; 4400 Partridge Lane, Mid- 
land, MI 48640 (US). 

(74) Agent: FOLEY, Shawn, R; Lerner, David, Littenberg, . 
Krumholz & Mentlik, LLP, 600 South Avenue West, West- 
field, NJ 07090 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FT, GB, GD, GE, GU, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VN, YU, ZA, ZM, ZW. 

[Continued on next page] 



(54) Title: METHODS FOR MASS SPECTROMETRY DETECTION AND QUANTIFICATION OF SPECIFIC TARGET PRO- 
TEINS IN COMPLEX BIOLOGICAL SAMPLES 



Predicted Differential Protease Cleavage Peptides of EPSPS 



< 
O 

in 

00 

© 



o 



Peptide Trypsin 


Lys-C 


fcrg-C 


Asp-N 


Clu-C 


Number* 


(KR) 


(K) 


(R> 


ID) ( 


E) 


1 


1328. S 


1456.6 


1328.5 


2S6B.9 


4477.0 


2 


146.2 


1373.5 


991.1 


2996.3 


902.1 


3 


062.9 


3460.9 


1105.2 


2146.5 


204.2 


4 


528.6 


1232. S 


13S9.S 


646 .7 


2089.5 


5 


S98.7 


5620.3 


2375.7 


1379. 5 


1784.0 


6 


1359. S 


2080.5 


287.4 


194S.2 


5144.9 


7 


1558 .7 


1639. a 


3315.7 


280.3 


B77.0 


8 


ess.o 


1860.2 


3048.5 


63 B. 7 


609S.1 


9 


287.4 


2978. S 


710.9 


2912.4 


1407.6 


10 


146.2 


3005.4 


1449.6 


190.2 


1345.5 


11 


3187. S 


8224 .5 


697.9 


5443 .5 


S557.S 


^12 


2450.8 


4332.0 


1230.4 


1991.2 


634.7 


13 


615. B 


1359.5 


3279.9 


204 .2 


1025.4 


14 


710. 9 


2316.5 


26 04 .8 


2068.4 


2042.3 


15 


789.9 


5950.9 


388.5 


386.4 


1033.1 


16 


617.6 


B04 .9 


473.5 


17*2.0 


1235.4 


11 


687.9 


160.2 


4375.1 


2714 .2 


891.0 


ie 


300.4 




2183.5 


1382.5 


247.1 


19 


94 B. 1 




1115.2 


303.3 


643.8 


20 


930.1 




273.3 


1757.0 


2076.3 


21 


2367.8 




1388 .S 


788.9 


204 .2 


22 


628.6 




3250.7 


3194 .6 


4292.0 


23 


1994 .2 




732 .8 


1S26 .6 


1808 .0 


24 


388 .S 




2559 .9 


236.2 


1395.7 


2S 


473.5 




22S7.4 


1414 .5 


704 .8 


26 


203 .2 




5251.1 


1538 .7 




27 


4189.9 






2315.7 




28 


2183 . 5 






133.1 




29 


3115.7 






1460.7 




30 


273.3 






1317.6 




31 


534 .6 






504 . S 




32 


B71.9 










33 


32S0 .7 










34 


245.3 










35 


505. S 










36 


872.0 










37 


170S.9 










38 


628.7 










39 


1646.8 








(amino acid 


cleaved) 










40 


4322.1 








single amino 
acid 


43 


804 .9 






preferred si2e 


42 


160.2 






peptides 






• from H- terminus 



(57) Abstract: Mass spectrometry-based methods are de- 
scribed for the detection or quantification of targeted pro- 
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ucts derived from plants, animals or microorganisms. 
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METHODS FOR MASS SPECTROMETRY DETECTION AND QUANTIFICATION 
OF SPECIFIC TARGET PROTEINS IN COMPLEX BIOLOGICAL SAMPLES 

TECHNICAL FIELD 

The present invention relates generally to the use of mass spectrometry to detect, 
5 quantify, and characterize target proteins in complex biological samples. 
BACKGROUND 

The development of sensitive, precise, high throughput methods for directly 
monitoring specific proteins is increasingly desirable for diagnostic applications in both 
agricultural and clinical fields. Recent advances in the use of mass spectrometry (MS) in 
10 conjunction with protein/DNA-sequence database search-algorithms allow for the 
identification of proteins with unprecedented speed (Mann and Wilm 1994, Yates et al 1995). 
Despite these advances, it remains difficult to obtain accurate quantitative information 
concerning the levels of the identified proteins and the levels of site-specific or other 
modifications to individual protein molecules. 

15 The principal state-of-the-art approaches for quantitative analysis of individual 

proteins are enzyme-linked immunosorbent assays (ELISAs) and sandwich ELISAs 
(sELISAs). The crux of these methods is the binding of an antibody molecule that recognizes 
the protein or peptide of interest. The development of an ELISA for a target protein is a 
particularly laborious and lengthy task, which requires production of monospecific antibodies 

20 to one or more epitopes residing within the protein. An epitope is a single limited amino acid 
sequence that is recognized by the antibody. ELISA systems based on a monoclonal antibody 
recognize a single epitope on the target protein ELISA systems based on polyclonal 
antibodies tend to recognize more than one epitope. The ELISA reaction requires binding of 
an antibody that may or may not be covalently linked to a group that generates enzymatic- 

25 derived colorimetric product or elicits fluorescence. These end-products of an ELISA are 
used for quantitation. 

ELISA methods have several shortcomings. These assays are indirect in the sense hat 
they require multiple steps to produce a product that is quantifiable. In addition, the 
occurrence of false positives and negatives is not uncommon. Thus, they may have 
30 insufficient sensitivity for commercial diagnoses, particularly in cases where there is 
significant risk of legal liability in the event of an incorrect result. Further, ELISA-based 
assays are limited in that they can only detect analytes for which antibodies have been raised. 
This requires prior knowledge of sample composition coupled with time-consuming effort in 
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order to prepare sufficient purified protein and raise a new antibody for each target protein or 
peptide species. All of these factors generally prevent EUSA-based assays from being 
applied to identify previously unknown species or. variant derivatives of a known species 
within a sample. Thus, ELISA systems are unable to detect subtle changes to a target protein 
5 that may have a dramatic effect on its physical and biological properties. For example, the 
antibody might not recognize a specific form of the protein or peptide that has been altered by 
post-translation modification such as phosphorylation or glycosylation, or conformationally 
obscured, or modified by partial degradation. Identification of such modifications is vital 
because changes in the physical and biological properties of these proteins may play an 

10 important role in their enzymatic, clinical or other biological activities. Such changes can 
limit the reliability and utility of ELISA-based quantification methods. 

In the absence of appropriate antibodies, quantification is usually achieved by 
autoradiography after metabolic radiolabeling, fluorography, or the use of protein stains. 
These procedures depend on complete separation of the proteins of interest by techniques 

15 such as chromatographic separation or high-resolution two-dimensional electrophoresis 
(Boucherie et al. 1996). 

In the late 1980's two new mass spectrometries became available for the analysis of 
large biomolecules, namely, matrix-assisted laser desorption/ionization (MALDI) time-of- 
flight mass spectrometry (TOF MS) and electrospiray ionization (ESI). Requiring only a 

20 minute sample, mass spectrometry provides extremely detailed information about the 
molecules being analyzed, including high mass accuracy, and is easily automated. Both of 
these instruments are capable of mass analyzing biomolecules in complex biological 
solutions. MALDI-TOF MS involves laser pulses focused on a small sample plate 
comprising analyte molecules embedded in a low molecular weight, UV-absorbing matrix 

25 that enhances sample ionization. The matrix facilitates intact desorption and ionization of the 
sample. The laser pulses transfer energy to the matrix causing an ionization of the analyte 
molecules, producing a gaseous plume of intact, charged analyte. The ions generated by the 
laser pulses are accelerated to a fixed kinetic energy by a strong electric field and then pass 
through an electric field-free region in a vacuum in which the ions travel (drift) with a 

30 velocity corresponding to their respective mass-to-charge ratios (m/z). The lighter ions travel 
through the vacuum region faster than the heavier ions thereby causing a separation. At the 
end of the electric field-free region, the ions collide with a detector that generates a signal as 
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each set of ions of a particular mass-to-charge ratio strikes the detector. Travel time is 
proportional to the square root of the mass as defined by the following equation t = 
(m/(2KE)z)l/2 where t = travel time, s = travel distance, m = mass, KE = kinetic energy, and 
z = number of charges on an ion. A calibration procedure using a reference standard of 

.--5 known mass can be used to establish an accurate relationship between flight time and the 
mass-to-charge ratio of the ion. Ions generated by MALDI exhibit a broad energy spread 
after acceleration in a stationary electric field. Forming ions in a field-free region, and then 
applying a high voltage pulse after a predetermined time delay (e.g. "delayed extraction™' 1 ) 
to accelerate the ions can minimize this energy spread, which improves resolution and mass 

iLO accuracy. . . 

I*- ■ 

In a given assay, 50 to 100 mass spectra resulting from individual laser pulses are 

summed together to make a single composite mass spectrum with an improved signako- 

noise ratio. The entire process is completed in a matter of microseconds. In an automated 

apparatus, tens to hundreds of samples can be analyzed per minute. In addition to speed, 

15 MALDI-TOF technology has many advantages, which include: 1) mass range- where the 

mass range is limited by ionization ability, 2) complete mass spectrum can be obtained from a 

single ionization event (also referred to as multiplexing or parallel detection), 3) compatibility 

, with buffers normally used in biological assays, 4) very high sensitivity; and 5) requires only 

feratomoles of sample. Thus, the performance of a mass spectrometer is measured by its 

20 sensitivity, mass resolution, and mass accuracy. 

In order for mass spectrometry to be a useful tool for detecting and quantifying 

proteins, several basic requirements need to be met. First, targeted proteins to be detected 

and quantified must be concentrated (e.g., enriched and/or fractionated) in order to minimize 

the effects of salt ions and other molecular contaminants that reduce the intensity and quality 

25 of the mass spectrometry signal to a point where either the signal is undetectable or 

unreliable, or the mass accuracy and/or resolution is below the value necessary to detect the 

target protein. Second, mass accuracy and resolution significantly degrade as the mass of the 

analyte increases. Thus, the size of the target protein or peptide must be within the range of 

the mass spectrometry device where there is the necessary mass resolution and accuracy. 

30 Third, to be able to quantify accurately, one would preferably resolve the masses of the 

peptides by at least six Daltons to increase quality assurance and to prevent ambiguities. 
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Fourth, the mass spectrometric methods for protein , detection and quantification diagnostic 
screening must be efficient and cost effective in order to screen a large number of samples in 
as few steps as possible. 

Mass spectrometry methods for the quantitation of proteins in complex mixtures have 
5 employed a system using protein reactive reagents comprised of three moieties that are linked 
covalently; an amino acid reactive group, an affinity group and an isotopically tagged linker 
group (Aebersold et al, 2000). This class of new chemical reagents is referred to as Isotope- 
Coded Affinity Tags (ICATs) (Gygi et al 1999). The reactive group embodied used 
sulfhydryl groups that react specifically with the amino acid cysteine. The presence of the 
10 affinity group facilitates the isolation of the specifically labeled proteins or peptides from a 
complex protein mixture. Selected affinity groups include strepavidin or avidin. Only those 

proteins containing these affinity groups may be isolated. The linker moiety may be 

3 13 15 17 18 34 
isotopically labeled by a variety of isotopes that include H, C, N, O, O and S. 

The use of differential isotopic ICATs provides a method for the quantitation of the relative 

15 concentration of peptides in different samples by mass spectrometry. The methods can be 

used to generate global protein expression profiles in cells and tissues exposed to a variety of 

conditions. 

In an analogous method, the N-terminal amino acids of proteins from two states are 
differentially labeled using different isotopically tagged nicotinyl-N-hydroxysuccinimide 

20 reagents (Munchbach et al, 2000). Unlike the ICAT system, proteins are first separated by 
two-dimensional SDS polyacrylamide gel electrophoresis before the analysis is performed. 
The ratio of the isotope for each protein determined by mass spectrometry provides a relative 
concentration of each protein present in different physiological states. 

It is believed that the limitations of mass spectrometry methods employing either 

25 ICATs or N-succinylation isotopic tagging are inherently associated with the requirement that 
the protein from one sample is quantified relative to another state or sample rather than being 
quantified in absolute amounts. In the case of the ICAT method, it is a requirement that the 
protein or peptide being quantified contains at least one amino acid that is modified by the 
reactive group. A related requirement is that the reactive amino acid site on the protein in the 

30 two or more states or samples must be equivalently accessible to the reactive group on the 
ICATs. Similar to antibody methods, if the site is altered or conformational^ obscured then 
the quantitation of the protein will be compromised. An additional limitation in the use of N- 
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succinylation of proteins is that it requires the laborious task of two-dimensional SDS 
polyacrylamide gel electrophoresis prior to analysis. 

There remains a pressing need for easier, more reliable means to rapidly detect, 
quantify and characterize specific proteins from biologically complex samples. 
5 SUMMARY OF THE INVENTION 

The present invention provides a method for the detection and direct quantification of 
a targeted protein in complex biological samples utilizing mass spectrometry (MS). 

A first aspect of the present invention is directed to a method of detecting a target 
protein in a complex biological sample (e.g., a product of biological origin), comprising: 
10 (a) obtaining a biological sample; 

(b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) to produce peptide fragments of the proteins, 
wherein the fragments include at least one signature peptide specific to the target protein; and 

15 (e) analyzying the peptide fragments via mass spectrometry, wherein detection of the at least 
one signature peptide is indicative of presence of target protein in the sample. 

In some embodiments, the target protein is quantified by calculating the amount of the 
. protein in the sample by comparing mass spectrometry signals generated from the signature 
peptide with mass spectrometry signals generated by its corresponding internal standard 

20 peptide. In preferred embodiments, the digesting produces peptide fragments containing at 
least two signature peptides and the analyzing comprises comparing mass spectrometry 
signals generated from at least one signature peptide with (mass spectrometry signals 
generated by an) its corresponding internal standard peptide. Thus, in embodiments where a 
plurality of signature peptides is . generated, quantification of target protein in the sample 

25 entails comparing mass spectrometry signals generated from at least one signature peptide 
with those of (i.e., mass spectrometry signals generated by an) its corresponding internal 
standard peptide. The analyzing may also involve mass spectrometry signals generated from 
any of the other signature peptides without comparison to their corresponding internal 
standard peptides, simply for confirmation or to increase confidence level. 

30 Another aspect of the present invention is directed to a method of quantifying a target 

protein which is an expression product of a transgene contained in a transgenic plant, tissue or 
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cell, or for quantifying the target protein in a product derived from the transgenic plant, 
comprising: 

(a) obtaining a sample from the transgenic plant or the product; 

(b) extracting proteins from the sample; 

5 (c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) (e.g., with a protease) to produce peptide 
fragments of the proteins, wherein the fragments include, at least one signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one 
1 0 signature peptide, an internal standard peptide; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from the signature peptide with mass spectrometry 
signals generated by the internal standard peptide. 

Yet another aspect of the present invention is directed to a method of quantifying a 
.15 target protein which is an expression product of a transgene contained in a transgenic animal, 
tissue or cell, or for quantifying the target protein in a product derived from the transgenic 
animal, comprising: 

(a) obtaining a sample from the transgenic animal or the product; 

(b) extracting proteins from the sample; 

20 (c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) (e.g., with a protease) to produce peptide 
fragments of the proteins, wherein the fragments include at least one signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one 
25 signature peptide, an internal standard peptide; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from the signature peptide with mass spectrometry signals 
generated by the internal standard peptide. 

A further aspect of the present invention is directed to a method of quantifying a target 
30 protein which is an expression product of a transgene contained in a transgenic 

microorganism, or for quantifying the target protein in a product derived from the transgenic 
microorganism, comprising: 
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(a) obtaining a sample from the transgenic microorganism or the product; 

(b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) (e.g., with a protease) to produce peptide 
fragments of the proteins, wherein the fragments include at least one signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one 
signature peptide, an internal standard peptide* an internal standard; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from the signature peptide with mass spectrometry 
signals generated by the internal standard peptide. In preferred embodiments for each 
of these aspects, the digesting produces peptide fragments containing at least two 
signature peptides and the analyzing comprises comparing mass spectrometry signals 
generated from at least one signature peptide with mass spectrometry signals 
generated by an internal standard peptide for at least one of the signature peptides. 
The methods of the present invention entail the digestion of the target protein so as to 
produce at least one signature peptide that is specific to the protein and perforce, not present 
in other proteins that may be present in the sample. Th^ also entail the use of one or more 
calibrants or internal standard peptides having similar structure and molecular weight to the 
signature peptide, usually in a predetermined concentration. To quantify the targeted protein, 
the mass spectrometry signals generated from the signature peptide(s) and the internal 
standard peptide(s) are compared. The mass spectrometry analysis of the internal standard 
peptide or calibrant may be performed simultaneously with or separately from the biological 
sample. 

The methods herein can be used for a variety of analyses in a wide variety of living 
organisms and in products derived therefrom. For instance, the methods may be applied to 
agricultural diagnostic analyses to detect the presence or absence of a targeted protein or set 
of targeted proteins in different types of tissue (e.g., seed, leaf, root, fruit) or in specific cell 
types or cell lines. The method is particularly useful in the detection of genetically modified 
organisms (GMOs) where a protein is being expressed, for example, for insect or herbicidal 
resistance, an enzyme or enzymes expressed to modify a metabolic pathway, and/or proteins 
are being expressed in plants for small and large-scale production. In addition, the method is 
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suitable to diagnose phytopathogenic disease or contamination in plants, soils, liquids, solids, 
and other samples from environmental sources. The present invention can also monitor for 
targeted proteins in manufacturing food processes. Monitoring of microbial (e.g. bacterial, 
fungal, etc.) populations in bioremediation sites e.g., where a targeted protein or proteins are 
5 specific for the detection and/or quantification in plants, soils, liquids, solids, and other 
samples from the environmental sources, may also be accomplished using the present 
invention, the methods herein can be used for a variety of clinical and diagnostic analyses to 
detect the presence or absence, and levels of a protein or proteins in a biological fluid (e.g. 
urine, blood, serum, saliva, etc.) or in specific cell types or tissues in humans and animals. A 

10 further aspect of the present invention is the monitoring of infections (e.g. bacterial, fungal, 
viral, etc.) and their treatment in humans and animals. 

The present ivention provides several advantages compared to current methods based 
on ELISA and mass spectrometry. One advantage of the present invention compared to 
ELISA is that it provides for a method for the direct quantitative measurement of the specific 

15 protein or peptide of interest in broad applications. Quantification of detected species can be 
completed simply using either internal or external calibrants that are modified synthetic 
peptides whose amino acid sequence is identical or nearly identical to the (signature 
fragments) of the protein(s) or peptide(s) to be quantified. The direct quantification is more 
efficient than current ELISA methods which require the lengthy and laborious production, 

20 preparation and maintenance of a uniform antibody for kits. In addition, it does not suffer 
from false positive and negatives as a result of indirect measurement. Unlike ELISA, the 
present methods can also be used to quantify proteins which may be posttranslationally 
modified or processed. For example, one internal standard could be used to quantify the 
targeted protein, while a second internal standard for the same protein could be used to 

25 quantify a post-translational peptide or processed protein. The peptides or proteins from the 
different samples can be selectively quantified by mass spectrometry. 

The present method for overcomes limitations associated with mass spectrometry 
sytems that quantify only relative concentrations by ICATs, N-succinylation or other mass 
spectrometry methods that involve chemical modification of the parent protein because it 

30 does not require the presence of specific amino acids that are accessible to reactive agents, or 
a second protein mixture for quantification. The present invention provides for multiple, 
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simultaneous, independent analyses of the protein of interest, fliereby significantly increasing 
the reliability of the diagnostic results obtained. 

Beyond the foregoing, the invention described can be employed for the quantitative 
analysis of targeted proteins in complex biological mixtures as well as for simultaneous 
-5 quantitative and quantitative analysis of multiple targeted proteins within a complex 
biological sample. Mass spectrometry is often the detection system of choice for structural 
characterization of analytes within complex biological mixtures due to its ability to 
specifically detect a wide range of analyte types. This makes MS ideal for the identification 
of unknown species during de novo research. Finally, MS analysis affords rapid and sensitive 
10 detection of a wide range of compounds at the low femtomole to high attomole range. The 
methods and reagents described herein can be employed in clinical and diagnostic assays for 
monitoring and quantifying targeted proteins from complex biological samples. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a table that shows predicted differential protease cleavage peptides of 5- 
15 enolpyruvylshikimate-3-phosphate synthase (EPSPS). 

Fig. 2 is a photograph of a two dimensional gel electrophoresis of soybean seed 
proteins. 

Fig. 3 A is a graph showing a tryptic peptide mass fingerprint of EPSPS excised from 
SDS-PAGE, and Fig. 3B is an amino acid sequence of EPSPS wherein underlined amino 
20 acids were detected by mass spectrometer. 

Fig. 4 is a photograph of an SDS-PAGE Commassie stained gel with different lanes of 
soybean leaf and seed tissue with the absence and presence of EPSPS, wherein the arrow (-►) 
* indicates location of EPSPS. 

Figs. 5A and B are graphs of calibration curves for (A) synthetic modified peptide 
125 MP- 1 and (B) synthetic modified peptide MP-2. 

Figs. 6A and B are graphs of tryptic peptide mass fingerprint of EPSPS with spiked 
calibrants MP-1 and MP-2, wherein Fig. 6B shows calibrants with parent peaks, and wherein 
arrows (j) indicate calibrants MP-1 = 1401.2 m/z and MP-2 = 1688.3. 
BEST MODE OF CARRYING OUT INVENTION 
30 Sample Types 
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The present invention provides mass spectrometry processes , for detecting and 
quantifying a targeted protein or proteins in a biological sample. For use in the invention, the 
biological sample should contain at least one protein. 

As used herein, the term "biological sample" refers to any material directly or 
5 indirectly derived from any living source (e.g. plant, human, animal, microorganism such as 
fungi, bacteria, virus). Examples of appropriate biological samples for use. in the invention 
include: tissue homdgenates (e.g. biopsies), cell homogenates; cell fractions; biological fluids 
(e.g. urine, serum, cerebrospinal fluid, blood, saliva, amniotic fluid, mouth wash); and 
mixtures of biological molecules including proteins, DNA, and metabolites. The term also 
10 includes products of biological origin including pharmaceuticals, nutraceuticals, cosmetics, 
and blood coagulation factors, or the portion(s) thereof that, are of biological origin e.g., 
obtained from a plant, animal or microorganism. 

Any source of protein in a purified or non-purified form can be utilized as starting 
material, provided it contains or is suspected of containing the protein of interest. Thus, the 
15 target protein of interest may be obtained from any source, which can be present in a 
heterogeneous biological sample. The sample can come from a variety of sources. For 
example: 1) in agricultural testing the sample can be a plant, plant-pathogen, soil residue, 
, fertilizer, liquid or other agricultural product; 2) in food testing the sample can be fresh food 
or processed food (for example infant formula, fresh produce, and packaged food); 3) in 
20 environmental testing the sample can be liquid, soil, sewage treatment, sludge, and any other 
sample in the environment which is required for analysis of a particular protein target; 4) in 
pharmaceutical and clinical testing the sample can be animal or human tissue, blood, urine, 
and infectious diseases. 

As used herein "protein target" refers to a specific protein or set of proteins, which are 
25 being monitored, detected, and/or quantified in a biological sample. A protein target may be 
referred to as a protein marker, which is diagnostic for a given biological sample. The 
identity or function of the protein marker does not need to be known. 
Protein Extraction 

Protein can be isolated from a particular biological sample using any of a number of 
30 procedures, which are well known in the art, the particular isolation procedure chosen being 
appropriate for the particular biological sample. For example, soft plant tissues and animal 
tissues can be homogenized in the presence of appropriate cold buffers in a Waring Blender, 
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polytxon or sonicator; seeds may . be macerated in a coffee grinder and further homogenized 
by mortar and pestle; bacteria and cell suspensions may be extracted by a French press, 
ultrasonication, or bead mill in the presence or absence of lysozymes; and red blood cells are 
easy to extract, after collection by centrifugation, and a rinsing step in isotonic NaCl, the cells 
■«* 5 are osmotically lysed with water or sonicated (Current Protocols in Protein Biochemistry, 
Cold Spring Harbor). 
Concentration of Target Protein 

Concentration of the target protein is desirable in order to obtain an appropriate 
quantity of a specific protein target on which to perform mass spectrometry. Concentration 
210 (e.g., enrichment) may be accomplished by any number of techniques and methods. 
i* Examples of appropriate means for concentration include the use of solid support resins (e.g. 
ion exchangers, affinity gel, and other resins that adsorb proteins). The resins may include 
beads (e.g. silica gel, controlled pore glass, Sephadex/Sepharose, cellulose, agarose,), that can 
be placed in columns (chromatography, capillary tubes), membranes or microtiter plates 
1 5 v . (nitrocellulose, polyvinylidenedifluoride, polyethylene, polypropylene), or on flat surfaces or 
chips or beads placed into pits in flat surfaces such as wafers (e.g. glass fiber filters, glass 
surfaces, metal surfaces (stainless steel, aluminum, silicon)). Alternatively, the beads may be 
added batchwise to protein solutions and then removed rapidly by centrifugation, filtration or 
magnetically (for magnetic beads). Such absorbents can capture or preferentailly absorb as 
20 much as 90% or more of the target protein. Other examples of concentration include gel 
electrophoresis, capillary electrophoresis, and pulsed field gel electrophoresis. The choice of 
a particular method will depend on a number of factors such as the amount of protein target 
present, the physical properties of the protein, the sensitivity required for the detection of the 
protein and the like. The method may also be designed such that the absorbent is specific to 
"25 the target protein or non-target proteins. 

In one embodiment of the present invention, a protein target is concentrated in a 
sample using polyacrylamide gels. The process involves homogenizing and extracting 
protein from a seed or leaf, adding the protein to either sodium dodecyl sulphate (SDS) 
polyacrylamide gel electrophoresis (PAGE) or to separate the proteins by isoelectric focusing 
30 followed by SDS-PAGE (referred to as 2-D gels). A protein band or protein spots can be 
excised from a gel, digested in situ by a chemical (e.g. CNBr) or proteolytic cleavage (e.g. 
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trypsin), followed by elution of the digested peptide fragments. Techniques are also available 
for the mass spectrometry analysis of whole proteins from gels. 

In a preferred embodiment of the present invention, a resin capable of adsorbing the 
target protein, such that the targeted protein is substantially (e.g., at least about 90%) 
5 dissociated from contaminating proteins, is used to enrich a protein target. A biological 
sample solution containing proteins is simultaneously enriched and filtered. The amount of 
sample that can be enriched using a given amount of resin can vary based upon the binding 
capacity of the resin. The simultaneous enriching and filtering procedure may be 
accomplished using a modified filtration technique. Filter techniques use devices such as 

10 filters and rely upon centrifugal or other driving force to wash and elute the sample through a 
structure such as a membrane. The size of the pores vary depending upon the protein target 
and biological sample. Any ultrafiltration device can be used to practice the present 
invention provided that the filter has a specific molecular weight cut-off. Such filters and 
ultrafiltration devices are commercially available from Millipore Corp., Bedford, Mass., or 

1 5 LifeScience Purification Technologies, Acton, Mass. 

In accordance with various embodiments of the present invention, resin may be placed 
in a> filtration device, for example, using the wells of a microtiter plate. The resin can be 
added to the microtiter plate in the form of beads. In this embodiment, tie resin is added to 
microtiter wells, which contain a membrane at the bottom of the well through which the 

20 sample is allowed to be washed and eluted through the container into a receptacle. The 
biological sample solution is added to the microtiter plate containing the resin. The sample 
interacts with the resin and ions in the sample solution are exchanged for ions on the resin. 
Upon centrifugation or vacuum filtration, the protein targets absorbed to the resin may be 
washed or eluted off the resin and ftrough the membrane filter. The enriched protein target is 

25 then collected from the receptacle. 
Protease Digestion of Target Proteins 

The enriched proteins are typically cleaved by a specific protease to generate peptide 
fragments. There are many different types of proteases one skilled in the art could use for 
cleaving proteins. For example: endoproteinase-Arg-C, endoproteinase-Aspn-N, 

30 endoproteinase-Glu-C (V8), endoproteinase-Lys-C, Factor Xa, papain, pepsin, thermolysin, 
and trypsin, are examples of endopeptidases. It is also possible to use a chemical compound, 
which may cleave at specific amino acids (e.g. CNBr, cleaves at methionines). Proteases and 
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chemicals generate different peptide fragment' lengths and thus different peptide masses. As 
illustrated in Figure 1, for example, the glyphosate resistant gene, 5-enolpyravylshi]rimate-3- 
phosphate (EPSPS), is differentially cleaved into numerous peptides with distinct mass sizes. 
Two or more proteases may be used to enhance the production of desired peptides either 

* 5 sequentially or concurrently. The peptides are preferably in the range from about 900 to 2500 
> Da but are not limited to these molecular sizes. 

^ Selection of Signature Diagnostic Peptides 
Sequence-Dependent Proteins 

The targeted protein can be digested e.g., cleaved by a specific protease, to generate a 

* 10 family of peptide fragments that can be analyzed by mass spectrometry to generate a peptide 
£ mass fingerprint. As used herein the term "signature peptide masses" refers to the peptide 

masses generated from a particular protein target or targets, which can used to identify the 
protein target Those peptide masses from a given peptide mass fingerprint which ionize 
easily and have a high mass resolution and accuracy, are considered to be members of a set of 

15 signature diagnostic peptide masses for a given target. The pattern is is unique and thus 
distinct for each protein. A "sequence-dependent protein" is a protein for which the amino 
acid sequence is already known (e.g., from gene sequencing). One skilled in the art will 
recognize that peptide mass fingerprints generated from a protein target can be easily 
compared with predicted peptide mass fingerprints generated in silico and predicted masses 

20 of a target protein. Thus, the location of where these peptide masses reside in a given target 
protein can be determined (e.g. a peptide fragment may reside near the N-terminus or C- 
terminus of a protein). 

w " The observed peptide masses of a targeted protein can be compared within silico- 

»■ predicted masses of a targeted protein for which the amino acid sequence is known. Those 
25 peptide masses from a given peptide mass fingerprint, which ionize easily and have high mass 
resolution and accuracy are considered to be members of a signature diagnostic peptide 
masses for a given target. For a sequence-dependent protein that is not available in purified 
form for peptide mass fingerprinting, signature diagnostic peptides can be selected by 
chemically synthesizing peptides corresponding to predicted peptide fragmoits from a 
30 particular target protein/proteinase combination and then determining their physico-chemical 
properties and behavior in the mass spectrometer. During peptide synthesis, a portion of the 
peptide can be chemically modified (e.g. by acetylation) to provide internal calibrants or 
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internal self-calibrants for inclusion in the set of peptide mass fingerprints to be analyzed by 
mass spectrometry. The inclusion of such chemically modified synthetic peptides facilitates 
improved mass accuracy and quantitation. 
Sequence-Independent Proteins 
5 The present invention also includes the identification of signature diagnostic peptide 

masses of a sequence-independent protein target. A "sequence-independent" protein is a 
protein where the amino acid sequence is not known initially, however the protein could be a 
known marker for a specific physiological or genetic state. Sequence-independent proteins 
can be purified by standard proteomics techniques, such as two-dimensional gel 

10 electrophoresis, and cleaved by a specific protease to generate a peptide mass fingerprint 
determined by mass spectrometry. Peptides that are not well represented in the {peptide mass 
fingerprint would be excluded from the set of potential signature diagnostic peptides. One 
skilled in the art will recognize that the order of these peptides in the progenitor protein may 
not be known. Amino acid sequencing may be accomplished by several means, a chemical 

15 method Edman degradation or by post-source decay (PSD) analysis on a mass spectrometry 
instrument. PSD analysis is an acquisition method tailored to the analysis of fragment ions. 
PSD mode allows for better analysis- of ion fragments in the flight tube by optimizing the 
mirror ratio and precursor ion selector. The choice of measurement method depends in part 
on the purity of the protein and the quantity of the protein to be analyzed. 

20 Detection and Quantitation of Peptides by Mass Spectrometry 

Once a set of signature diagnostic peptide masses have been identified either from a 
sequence-dependent or sequence-independent protein target, it is possible to detect or 
determine the absolute amount of the targeted protein in a complex mixture by synthesizing 
the internal standards in the mass spectrometer. For quantification, a known amount of 

25 internal standards, at least one such peptide and in preferred embodiments, two for each 
specific protein in the mixture to be detected or quantified, are added to the sample to be 
analyzed. In particular, the internal standard peptides are identical to peptides which have 
originated from the targeted protein, except that they are modified to distinguish them from 
the mass of the equivalent signature peptide to be quantified. Qualitative analysis does not 

30 require presence of the internal standard peptide. 

In an embodiment of the present invention the use of internal standard-peptides are 
used not only to detect but also determine the absolute amount of the target protein or 
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proteins in a complex mixture. These internal standard peptides are of particular use to 
monitor and quantify the target protein. In this application, the internal standard peptide is 
chemically identical to a peptide fragment determined from a signature diagnostic peptide 
mass fingerprint, except that the peptide has been, modified in such a way that there is a 
5 distinct mass difference compared to the parent mass that allows it to be independently 
detected by MS techniques. One skilled in the art can synthesize the amino acid sequence 
^ ' and modify a specific amino acid to distinguish the peptide from the parent peptide. For 
example, peptides can be modified by acetylation, amidation, anilide, phosphorylation, or by 
substituting one or more atoms of one or more amino acids with a stable isotope to generate 
f *-10 one or more substantially chemically identical, but isotopically distinguishable internal 
standard peptides. For example, any hydrogen, carbon, nitrogen, oxygen, or sulfur atoms may 
be replaced with isotopically stable isotopes: 2 H, 13 C, 15 N, 17 0, or 3 ^S. The internal 
standard peptides can be used in the method described herein to quantify one or several 
protein targets in a biological sample. 
15 In one aspect, the invention provides a mass spectrometric method for detection and 

quantification of one or more proteins from a complex biological solution, which employs a 
resin in which the resin can separate or absorb targeted proteins based upon the properties of 
the targeted protein. Such that the targeted protein will either absorb to the resin or 
contaminating proteins will absorb to the resin. It may be necessary to wash the resin to 
20 remove contaminating proteins and thus reducing the complexity of the biological solution. 
Following a wash step the targeted protein or proteins may be eluted with specific buffers to 
dissociate the protein. After the proteins have been eluted, the proteins are digested e.g., with 
$ a specific protease to generate peptide masses, which can be analyzed by mass spectrometry 
■ techniques. Signature diagnostic peptide masses will provide the information as to wheher 
"-25 or not the targeted protein or proteins have been detected in the complex biological solution. 
The proteolytic step may not be necessary, if the targeted proteins can be detected directly by 
the mass spectrometer with sufficient accuracy to avoid confusion with other non-target 
proteins. 

Quantification of targeted proteins in one or more different samples containing protein 
30 mixtures (e.g. biological fluids, cell or tissue lysates etc.) can be determined using identical 
peptides based upon in silico proteolytic digests of targeted proteins, which have been 
modified as to change the mass. The amounts of a given targeted protein in each sample is 
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determined by comparing the abundance of the modified peptides from any modified peptide 
originating from that protein. The method can be used to quantify amounts of known proteins 
in different samples. More specifically, the method can be applied to screen for and identify 
proteins^ which exhibit differential expression in cells, tissue or biological fluids. It is thus 
5 possible to determine the absolute amounts of specific proteins in a complex mixture. In this 
case, a known amount of internal standard, at least one for each specific protein in the 
mixture to be quantified is added to the sample to be analy2ed. The internal standard is 
identical to the peptide from which it originated except that it is modified to distinguish it 
from the signature peptide mass to be quantified. The internal standard can be provided in the 
10 sample to be analyzed in other ways. For example, a specific protein or set of proteins can be 
monitored by multiple internal peptides to detect and quantify several targeted proteins 
simultaneously. 

Accurate quantification of the target protein is achieved through the use of 
synthetically modified peptides that have amino acid identity, or near identity, to signature 

15 diagnostic peptides and have been predetermined for molecular weight and mass. The typical 
quantification analysis is based on two or more signature diagnostic peptides that are 
measured to reduce statistical variation, provide internal checks for experimental errors* and 
provide for detection of post-translation modifications. The method of this invention can be 
used for qualitative and quantitative analysis of single or multiple targeted proteins in 

20 complex biological samples for a variety of applications that include agricultural, food 
monitoring, pharmaceutical, clinical, production monitoring, quality assurance and quality 
control, and the analysis of environmental samples. 

To facilitate mass spectrometric analysis, peptides and proteins generated from either 
"in-gel" proteolysis or from biological solutions may be concentrated, desalted, and 

25 detergents removed from peptide or protein samples by using a solid support. Examples of 
appropriate solid supports include Ci8 and C4 reversed-phase media, ZipTip (Millipore). 
Immobilization of peptides or proteins can be accomplished, for example, by passing peptides 
and proteins through the reversed-phase media the peptides and proteins will be adsorbed to 
the media. The solid support-bound peptides or proteins can be washed and then eluted, 

30 which improves overall detection by mass spectrometry. 

Preferred mass spectrometer formats for use in the invention are matrix assisted laser 
desorption ionization (MALDI) and electrospray ionization (ESI). For ESI, the samples, 
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dissolved in water or in a volatile buffer, are injected either continuously or discontinuously 
into an atmospheric pressure ionization interface (API) and then mass analyzed by a 
quadrupole. The generation of multiple-charged ion peaks, which can be obtained using ESI 
mass spectrometry, can increase the accuracy of the mass determination. Even more detailed 

^ 5 .. information on the specific structure can be obtained using an MS/MS quadrupole 
configuration. For a more detailed discussion and comparison of various mass spectrometry 

v/: technologies, see Siuzdak, Mass Spectrometry for Biotechnology, Academic Press (1995). 

In MALDI mass spectrometry, various mass analyzers can be used, e.g., magnetic 
sector/magnetic deflection instruments in single or triple quadrupole mode (MS/MS), Fourier 

v-10 transform and time-of-flight (TOF) configurations as is known in the art of mass 

* spectrometry. For the desorption/ionization process, numerous matrix/laser combinations can 
be used. Ion trap and reflectron configurations can also be employed. 

Mass spectrometers are typically calibrated using analytes of known mass. A mass 
spectrometer can then analyze an analyte of unknown mass with an asspckted mass accuracy 

15 and precision. However, the calibration, and associated mass accuracy and precision, for a 
given mass spectrometry system can be significantly improved if analytes of known mass 
(e.g., molecular weight) are contained within the sample containing the analyte(s) of 
unknown mass(es). The inclusion of these known mass analytes within the sample is referred 
to as use of internal calibrants. The preferred practice is to add known quantities of the 

20 calibrant. For MALDI-TOF MS, generally only two calibrant molecules are needed for 
complete calibration, although sometimes three or more calibrants are used. All of the 
embodiment of the invention described herein can be performed with the use of internal 

* calibrants to provide improved mass accuracy. 

One skilled in the art will readily recognize that the method described in this 
*25 invention has many advantages. It can be readily modified for automated detection and 
quantification of target proteins. In one embodiment of the present invention is a machine for 
processing the sample, enriching the protein target, cleaving the protein target, and 
transferring the peptides to mass spectrometry for detection and quantification of the peptide 
masses, and a computer means for recording and outputting the results of the MS spectra. 
30 Another embodiment is a kit for the detection of a specific target protein in specific sample 
types, which provides the user with reagents that have been customized for a particular target 
protein. Thus, in preferred embodiments, the kit contains extraction buffer(s), enrichment 
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resin(s), protease(s), synthetic diagnostic peptide(s) and internal standard peptide(s), and 
precise instructions on their use. 

The methods, instruments and procedures described herein can be used for a variety of 
purposes. Because of the sensitivity and specificity of the analysis one skilled in the art will 
5 readily recognize uses for this methodology. What follows is a representative list of uses in 
specific areas where a current need exists for a quick and reliable analysis. 

The methodology of the present invention is usefiil in the areas of agriculture. One 
can monitor genetically modified organisms (GMO's) in all agricultural crops where a protein 
is being transgenically expressed. Detecting or quantifying the target proteins that are 

10 specific to a genetically modified organism is important for diagnosing or monitoring levels 
of such proteins and determining whether a product or sample is "GMO-firee" Examples of 
genetically modified organisms, which may be detected and quantified by the disclosed 
processes include: glyphosate tolerant (e.g. corn, beet, rapeseed, soybean, cotton), 
phosphinothricin tolerant (e.g. rapeseed, beet, corn, soybean, rice), bromoxynil tolerant (e.g. 

15 cotton), lepidopteran resistant (e.g. com, tomato, cotton), Colorado potato beetle resistant 
(e.g. potato), PLRV resistant (e.g. potato), PVY resistant (e.g. potato), male sterile (e.g. com), 
com root worm resistant (e.g. com), CPB resistant (e.g. potato), modified oil profife (e.g. 
rapeseed, soybean), European core borer resistant (e.g. corn), sulfonylurea tolerant (e.g. 
cotton), and potato beetle resistant (e.g. potato). 

20 This procedure is very useful in the area of monitoring plant pathogens. One can 

monitor plants that have been infected by any number of different pathogens. Examples of 
plant pathogens which could be monitored include: viruses (e.g. African cassava mosaic 
virus, alfalfa mosaic virus, apple chlorotic leaf spot virus, apple mosaic virus, banana bunchy 
top virus* banana streak virus, barley stripe mosaic virus, barley yellow dwarf virus, 

25 cauliflower mosaic virus, cucumber mosaic virus, maize streak virus, papaya mosaic virus, 
potato vims Y, soybean mosaic virus, tobacco mosaic virus, wheat streak mosaic virus); 
viroids (e.g. chrysanthemum stunt viroid, potato spindle tuber viroid); fungi (e.g. 
phytophthora, verticillium daliliae); and bacteria (e.g. clavibacter michiganensis subs. 
Michiganensis, erwinia amylovora, erwinia carotovora, envinia chrysantheini, erwinia 

30 stewartiUpseudomonas and xanthomonas). 
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This procedure is very useful in the area of monitoring food contamination. Food 
producers currently test their products for specific contamination (e.g. microorganisms, 
genetically modified organisms). The present invention will facilitate this testing. 

Another application of Ihis method is in the manufacturing process. A number of 
" 5 manufacturing processes for fermentation processes, food manufacturing, chemical 
manufacturing, and drugs rely on the detection and quantification of protein markers. In 
either case the method of the present invention can be used. It can also be used for quality 
control in monitoring protein levels in laboratory tests. 

The following examples are offered by way of example, and are not intended to limit 
10 the scope of the invention in any manner. 

Example 1. Predicted proteolytic cleaveage of 5-enolpyruvylshikimate-3-phosphate synthase 
present in genetically modified plants. 

For those proteins that correspond to genes that have been cloned and sequenced, 
such as for 5^nolpyravylshikiinate-3-phosphate synthase (EPSPS), its amino acid sequence 
15 can be readily deduced from the DNA sequence. The deduced amino acid sequence of 
EPSPS was subjected to theoretical or in silico protease digestion using 5 distinct proteases. 
Three of the proteases cleave at basic amino acid sites (trypsin, Arg-C, Lys-C) and the two 
other proteases at acidic sites (Asp-N and Glu-C). Arg-C and Lys-C cleave at the carboxy- 
terminus of arginine and lysine residues, respectively. Trypsin cleaves at two amino acid 
20 sites, lysine and arginine residues. Asp-N cleaves at amino-terminus of aspartic residues 
while Glu-C cleaves at the carboxy-terminus of glutamic residues. 

In Figure 1, the peptides cleaved by the five proteases are listed from the amino- 
terminus to the carboxy-terminus. The preferred peptides masses range from 900 to 2500 
daltons in molecular size. As illustrated by trypsin cleavage (two sites), in the event that 
k 25 single amino cleavage does provide a sufficient number of peptides, a second protease may 
be used to increase the number of preferred peptide molecular sizes. The additional number 
of peptides generated by two-site cleavage provided an increased and better distribution of 
peptides throughout the entire protein. 

Example 2. Enrichment of 5-:enolpyruvylshikimate-3-phosphate synthase EPSPS from 
30 soybean seeds 

Enrichment of 5-enolpyruvylshikimate-3-phosphate synthase from transgenic soybean seeds. 
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The initial step for the detection and quantitation of a GM protein in plants is to obtain 
a protease digest profile of peptides for the protein of interest. The protein may be available 
in large amounts from a recombinant source such as bacteria or insect (baculovirus expressed) 
cells. Alternatively, transgenic plants expressing sufficient levels of the protein may provide 
5 the material required for peptide profiling. For EPSPS, the latter source was used as the 
source for peptide profiling. 

The plant material was subjected to a series of enrichment protocols for the 
identification of EPSPS and was preliminarily identified when comparing colloidal stained 2- 
D gel patterns of proteins isolated from non-transformed and transgenic soybean seeds 

10 expressing EPSPS. 

Ten seeds were ground in a pre-chilled coffee grinder to a fine powder. The powder 
was transferred to a mortar and homogenized with the addition of 6 mL of ice cold extraction 
buffer containing 100 mM Tris HC1, pH 7.8, 5 mM magnesium chloride, 100 mM sodium 
borate, 0.2% sodium ascorbate, and 0.05% w/v Tween 80. The slurry was transferred to a 2 

15 mL Kontes glass tissue grinder for further homogenization. The homogenate was transferred 
to a 1.5 mL microfuge tube, sonicated for 5 min., and chilled on ice for 10 minutes. This 
procedure was: repeated twice. Insoluble cellular debris was removed by centrifuging the 
homogenate for 10 min. at 14,000 g. The supernatant was decanted into a clean 1.5 mL 
microfuge tube. The Bio-Rad Protein Assay kit using a bovine gamma globulin standard was 

20 used to determine the protein concentration of the supernatant. 

The resulting soluble proteins were next characterized by high-resolution two 
dimensional gel electrophoresis. Proteins (420 jug) were solubilized for isoelectric focusing 
(IEF) analysis in rehydration sample buffer consisting of 5 M urea, 2 M thiourea, 2% (w/v) 
CHAPS, 2% (w/v) SB 3-10, 40 mM Tris, 2 mM tributyl phosphine (added to rehydration 

25 solution just before use), and 0.2% Bio-Lyte 3/10. Protein/rehydration solution was 
rehydrated into 11 cm IPG ReadyStrip pH 3-10 under passive conditions: 0 volts, 20°C, 16 
hrs. 

One-dimensional isoelectric focusing was carried out on a Protean IEF cell (Bio-Rad) 
for 35,000 volt-hours using IPG ReadyS trips (Bio-Rad). Following first dimension 
30 electrophoresis, gels were equilibrated for 20 minutes in a buffer containing 20% glycerol, 
0.375 M Tris, 6 M urea, 2% SDS, and 5 M tributyl phosphine. PG ReadyStrips were placed 
on top of a Criterion™ precast 1 mm 4-20% gradient Tris-HCl-SDS gel (Bio-Rad) and 0.5% 
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warm Agarose containing 0.01% bromophenol blued was added to the remaining well. 
Electrophoresis was carried out on a Criterion mini electrophoresis cell (Bio-RAD) at room 
temperature. The electrophoresis running buffer was prepared from a 10X Tris-glycine-SDS 
solution (Bio-RAD). Following assembly of the gel system .and addition of the running 
buffer, the electrophoresis was carried out at an initial current of 2 mA, 3500 volts, 45 watts, 
for 1.5 hrs. The current was ramped up to 5 mA for 30 minutes followed by 10 mA for 2-3 
hfs. Typical run times were between 4-5 hrs. Following electrophoresis, gels were stained in 
a buffer consisting of 17% ammonium sulfate, 30% methanol, 3% phosphoric acid, and 0.1% 
coomassie brillant blue G250, for at least 12 hrs. Gels were rinsed with water and stored in 
2% acetic acid until further processing. 

Colloidal Coomassie-stained gel images were captured using an EPSON 1200 photo 
imager. Digital filtering algorithms were used to remove non-uniform background without 
removing critical image data. Internal standards 2-D gel standards were used initially to 
precisely determine the molecular weight and isoelectric point of targeted proteins of intereS. 
5-enolpymvylsWlrimate-3 ^phosphate synthase preliminary identification by 2-D gel 
differential display. 

As described previously, 2-D gel electrophoresis displays were obtained by separating 
and resolving proteins isolated from either non-transformed or transgenic soybean seeds 
expressing EPSPS. Fig. 1 shows a typical protein pattern by transgenic soybean seeds 
expressing EPSPS. Approximately 50 proteins were resolved per gel in this specific 
quadrant. Most of the proteins observed were always present in different pools of 10 seeds 
analyzed. 

Fig. 2 illustrates a 2-D gel display of EPSPS expressed transgenic soybean seed 
proteins. EPSPS is the only protein which was differentially present in the transgenic 
material, but not in the non-transformed soybean seed differential display. The remaining 
proteins were present in both 2-D gels. The one protein having a molecular weight of 
approximately 48 kDa and a pi of 5.2 is unique to the transgenic soybean seed sample 
(labeled in Fig. 2). The protein labeled EP-1 was reproducibly detected in all samples 
analyzed which had been previously identified as being transgenic for EPSPS. Table 1 
identifies proteins, designated either as FLB-1 through FLB-7 or EP-1, by their approximate 
molecular weight and isoelectric point. Both the molecular weight and isoelectric point 
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values listed in Table 1 are approximate and accurate to within 3,000 Daltons for molecular 
weight and to within 0.2 pi units for isoelectric point. 

Table 1. Related and un-related proteins from non-transformed and transgenic soybean seed 
lines expressing EPSPS. 



Protein ID 


MW(kDa) 


Pi 


Nori-transgenic 


Transgenic 


FLB-1 


52 


5 ; 1 


+ 


+ 


FLB-2 


40 


6.8 




+ • 


FLB-3 


42 


6.9 


+ 


+ 


FLB4 


48 


5.5 


+ 


+ 


FLB-5 


53 


5.7 


+ 


+ 


FLB-6 


53 


5.9 


+ 


+ 


FLB-7 


53 


6.1 


+ 


+ 


EP-1 


48 


5.2 




+ 



5 



Proteins FLB-1 through FLB-7 were reprodiicibly detected in all soybean seed lines 
and pools of seeds tested, these proteins had a distinct pattern and were further characterized 
to rapidly monitor the presence or absence of EP-1 relative to both the molecular weight and 
isoelectric point. EP-1 was detected only in those lines or pools of seeds, which were 

1 0 identified as having the EPSPS gene. 

Example 3. Identification of diagnostic marker peptides for 5-enol-pyruvylshikimate-3- 
phosphate synthase and beta-conglycin in soybean seeds 
Identification of EP-1 and FLB-1 to FLB-7 diagnostic protein markers. 

The eight protein spots detectable on a 2-D gel corresponding to EP-1 and FLB-1 

15 through FLB-7 were excised, proteolytically cleaved, analyzed and identified as described 
below. 

Identical protein spots from 1-3 gels were excised manually and pooled. The gel 
pieces were macerated and destained with 25 mM ammonium bicarbonate/50% acetonitrile in 
a 1.5 mL microfuge tube with vigorous shaking for 30 minutes. The bluetinted destaining 
20 solution was removed and discarded with a fine-tip pipette. The destaining step was repeated 
until the stain was removed from the gel pieces. The gel pieces were dried under vacuum for 
10 to 15 minutes. Proteins were digested overnight at 37 °C in a total volume of 25 jiL of 
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sequence-grade, modified trypsin (Roche Diagnostics) at a final protein of 25 ng/pL in 25' 
mM ammonium bicarbonate. Peptides were eluted with 50% acetonitrile and 0.5% 
trifluoroacetic acid. All peptide samples were concentrated, desalted, and detergents 
removed by using either C4 or Ci8 reversed-phase ZipTip™ pipette tips as described by the 
5 ^manufacturer (Millipore). 

The resulting tryptic peptides were analyzed directly by mass spectrometry. All mass 
"spectrometry experiments were carried out on a PerSeptive Biosystems (Framingham, MA) 
Voyager DE-STR equipped with a N2 laser (337 nm, 3-nsec pulse width, 20-Hz repetition 
rate). The mass spectra were acquired in the reflectron mode with delayed extraction. 

10 ""Internal mass calibration was performed with low-mass peptide standards, and mass- 
" measurement accuracy was typically +0.1 Da. All peptide samples were diluted in c^cyano- 
4-hydroxycinnamic acid, which had been prepared by dissolving 10 mg in 1 mL of aqueous 
50% acetonitrile containing 0.1% trifluoroacetic acid. The measured tryptic peptide masses 
were used as inputs to search the NCBInrS. 13.99 database. All searches were carried out 

15 using the Protein Prospector software developed at UCSF. No restrictions were placed on the 
species of origin of the protein. The allowed protein molecular mass range was 1,000 to 
150j000 Da. Isoelectric points were allowed to range from 3.0 to 10.0, and oxidation of Met 
was included as a side reaction. Up to one missed tryptic cleavage was considered, and a 
mass accuracy of +0.5 Da was used for all tryptic-mass searches. The number of tryptic 

20 peptides required for identification varied as a function of the quality of the MALDI mass 
spectra. 

Diagnostic protein markers for EP-1 were identified. 

Using the methodology described above* EP-1 (MW 48 kDa, pi 5.2) was isolated from 
three 2-D gels of transgenic soybean seeds encoding the EPSPS gene, digested with trypsin, 
25 and the peptides were extracted. The resulting peptide mixture was analyzed directly by 
MALDI-MS. Correlation of the determined masses with the EPSPS sequence is shown in 
Table 2. Eight diagnostic tryptic peptide masses were identified, which covers 30.4% of the 
EPSPS sequence (Figs. 3 A and 3B). 

Table 2. Peptide masses obtained by MALDI-MS of the 48 kDa, pi 5.2 EP-1 protein spot 
30 digested with trypsin. 



1 
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Observed 
M+H 


Predicte 
dM 


Sequence 


Residues 


711.2 


710.9 


VLNPLR 


133-138 


948.1 


948.1 


TPTPITYR 


161-168 


1358.9 


1359.5 


SFMFGGLASGETR 


34-46 


1645.9 


1646.8 


GLGNASGAAVATHLDHR 


389-405 


1993.7 


1994.2 


MLQGFGANLTVETDADGVR 


206-224 


2182.6 


2183.5 


TGLILTLQEMGADIEVMPR 


275-294 


2366.6 


2367.8 


SAVLLAGLl^GITTVIEPIMTR 


178-200 


3250.2 


3250.7 


APSMIDEYPILAVAAAFAEGATVMNGLEELR 


321-351 



♦underlining reflects sequences confirmed by MALDI-PSD. 

Sequence tags were obtained from EP-1 to definitively identify the protein by mass 
spectrometry. Post source decay (PSD) fragment ion spectra were acquired for two peptides 
(1358.9. and 1646.8 m/z) after isolation of the appropriate precursor ion by using timed ion 
5 selection. Fragment ions were refocused onto the final detector by stepping the voltage 
applied to the reflector in the following ratios: 1.0, 0.913, 0.842, 0.756, 0.605, 0.412, 0.274, 
0.198, and 0.121 (fragment segments). The individual segments were stitched together by 
using software provided by PerSeptive Biosystems. All precursor ion segments were 
acquired at low laser power (variable attenuator = 1,450) for <256 laser pulses to avoid 

10 saturating the detector. The laser power was increased for all of the remaining segments of 
the PSD acquisitions. Typically, 256 laser pulses were acquired for each fragment-ion 
segment. The PSD data were acquired at a digitization rate of 20 MHz. Mass calibration was 
performed with peptide standards. Metastable decompositions were measured in all PSD 
mass spectrometry experiments. The PSD analysis verified the amino acid sequence of both 

15 1358.9 and 1646.8 m/z peptide masses. 

Diagnostic protein markers FLB-1 to FLB-7 were identified. 

Peptide mass fingerprints were generated for seven distinct proteins, which were used 
as reference markers to identify the presence or absence of EP-1 (Table 3). The tryptic map 
of each protein spot contained approximately 25 resolved peaks. Mass analysis was 
20 performed on each peptide mass fingerprint to identify each protein. Searching the NCBR 
database identified two of the seven proteins. FLB-1 with an apparent molecular weight of 
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52 kDa suggests identification of the protein by 16 matches of 22 tryptic peptide masses 
(Table 4). These matches suggest the identification of a soybean beta-conglycinin, alpha 
chain precursor. 

Table 3. Peptide masses obtained by MALDI-MS of seven proteins excised from 2-D gels 
5 and digested with trypsin. 



FLB-l 
52 


FLB-2 
48 


FLB-3 
52 


LB-4 
48 


FLB-5 
42 


FLB-6 
42 


FLB-7 
38 


564:i 


568.0 


653.1 


564.2 


568.0 


545.3 


545.3 


621.1 


634.1 


659.2 


621.3 


745.3 


564.2 


564.3' 


637.0 


662.1 


745.2 


659.2 


797.2 


568.1 


6593 


784.1 


722.9 


797.1 


712.2 


897.2 


627.3 


876.9 


859.0 


804:0 


890.0 


720.1 


973.2 


659.3 


956.3 


955.0 


S32.0 


892.0 


807.2 


1036.2 


745.3 


1064.4 


1051.0 


890.0 


948.1 


859.3 


1065.3 


775.3 


1112.3 


1079.0 


943.8 


973.1 


915.2 


1090.1 


797.2 


1179.3 


1169.9 


1020.1 


989.0 


955.2 


1142.2 


897.2 


1201.2 


1187.8 


1180.0 


1036.1 


1036.2 


1158:2 


956,3 


1235.2 


1244.0 


1421.6 


1061.0 


1064.2 


1179.2 


1064.3 


1296.2 


1262.9 


1487.8 


1140.9 


1079.2 


1202.3 


1112.2 


1322.2 


1293.9 


1504.9 


1158.0 


1107.1 


1235.1 


1154.1 


1361,3 


1377.9 


1527.8 


1180.1 


1149.1 


1277.3 


1179.2 


1394.1 


1391.0 


1797.8 


1208.0 


1 179.2 


1296.1 


1201.2 


1407.1 


1533.8 


2161.6 


1278.0 


1188.1 


1300.1 


1235.1 


1493.2 


1617.8 


2178.6 


1299.9 


1235.1 


1307.2 


1243.2 


1512.2 


1741.7 


2216.5 


1307.0 


1277.2 


1361.2 


1296.1 


1535.2 


1769.8 


2233.4 


1383.9 


1308.1 


1407.1 


1361.2 


1553.1 . 


1792.7 


2272.6 


1486.9 


1391.3 


1475.2 


1397.0 


1585.1 


1938.3 


2726.2 


1583.8 ' 


1406.1 


1488.2 


1407.1 


1618.3 


2025.57 




1638.9 


1475.1 


1552.1 


1497.0 


1729.1 


2151.5 




1698.9 


1487.1 


1585.1 


1512.1 


1774.2 


2457.4 




1715.8 


1534.1 


1619.2 


1535.0 


1792.0 
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1738.8 


1618.1 


1638.2 


1551.2 


1970.2 






1791.6 


1715.9 


1699.1 


1585.1 


2180.1 






1844.7 


1770.1 


1717.1 


1618.2 


2383.8 






1993.7 


1791.0 


1729.0 


1729.1 


2596.8 






2162.5 


1914.0 


1791.0 


1756.0 








2179.6 


1993.6 


1838.1 


1775.0 








2221.8 


2025.9 


1994.1 










2238.4 


2151.8 


2181.0 










2383.0 




2383.9 










2704.7 




2412.0 










2717.2 




2703.8 










2854.0 




2706.9 










3041.7 











Table 4. Mass correlation of FLB-1 derived tryptic peptides, soybean beta-conglycinin, alpha 
chain precursor. 



Observed 


Matched 


Residues 


Peptide 


M/z 


MH + 




Sequence 


564.23 


564.28 


220-223 


(R)FNQR(S) 


564.23 


564.31 


393-397 


(K)SSSRK(T) 


621.3 


621.37 


356-360 


(K)VLFSR(E) 


859.3 


859.46 


279-286 


(R)LQSGDALR(V) 


955.27 


955.53 


224-231 


(R)SPQLQNLR(D) 


1079;2 


1079.54 


410-418 


(R)SRDPIYSNK(L) ; 


1188.12 


1188.52 


361-370 


(R)EEGQQQGEQR(L) 


1308.18 


1308.72 


419-429 


(R)LGKFFEITPEK(N) 


1391.32 


1391.62 


47-57 


(R)DSYRNQACHAR(C) 


1391.32 


1391.87 


306-318 


(R)LITLAIPVNKPGR(F) 


1406.18 


1406.72 


398-409 


(K)TISSEDKPFNLR(S) 


1534.18 


1534.82 


397-409 


(R)KTISSEDKPFNLR(S) 
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1618.17 


1618.85 


422-434 


(K)FFEITPEKNPQLR(D) 


1770.13 


1770.99 


371-385 


(R)LQESVTVEISKEQIR(A) 


1791.09 


1790.87 


356-370 


(K)VLFSREEGQQQGEQR(L) 


1914.0 


1913.94 


340-355 


(R)N1LEASYDTKFEEINK(V) 


2025.9 


2025.94 


479-494 


(K)EQQQEQQQEEQPLEVR(K) 


2151.88 


2152;03 


287-305 


(R)WSGTTYYVVNPDNNENLR(L) 



A peptide mass fingerprint for FLB^4 obtained 16 of 24 tryptic peptide matches to 
a protein known as soybean beta-conglycinin, beta chain precursor (Table 5, NCBInr5.13.99 
database). This , suggests that both the alpha and beta chain precursor proteins have been 
identified for soybean beta-conglycinin. 



5 Table 5. Mass correlation of FLB-4 derived tryptic peptides, soybean beta-conglycinin, beta 
chain precursor. 



\JUoyi VCU 


lvlaLCncU 


ivcsiQues 


Peptide 


M/z 


1YJLXJ. 




oequence 


545.0 


545 3 


21 6-91 Q 




545.0 


545.3 


383-386 


(R)QTER(Q) 


564.0 


564.3 


59-62 


(R)FNKR(S) 


564.0 


564.3 


227-231 


(K)SSSRK(T) 


659.0 


659.3 


220-224 


(R)QLSRR(A) 


956.0 


956.5 


63-70 


(R)SPQLENLR(D) 


1064.4 


1064.6 


148-157 


(K)LAIPVNKPGR(Y) 


1112.0 


1112.6 


62-70 


(K)RSPQLENLR(D) 


1235.3 


1235.5 


195-204 


(R)VLFGEEEEQR(Q) 


1361.0 


1361.7 


371-382 


(R)NFLAGEKDNWR(Q) 


1407.0 


1407.6 


232-243 


(K)TIS SEDEPFNLR(S) | 


1512.0 


1512.7 


318-329 


(K)QKQEEEPLEVQR(Y) 


1535.0 


1535.7 


231-243 


(R)KHSSEDEPFNLRS(S) 


1585.0 


1585.7 


26-37 


(K)VREDENNPFYFR(S) 


1618.0 


1618.8 


256-268 


(K)FFEITPEKNPQLR(D) 


1774.2 


1773.8 


387-402 


(R)QVQELAFPGSAQDVER(L) 
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2180.0 


2181.1 


126-144 


(R)IPAGTTYYLVNPHDHQNLK(I) 


2596.85 


2597.2 


103-125 


(R)AILTLVNNDDRDSYNLHPGDAQR(I 
) 



The above peptide mass fingerprints (FLB-1 to FLB-7) may be used to identify 
each protein marker, which when resolved on a 2-D gel illustrates a unique protein pattern. 
Thus, the protein pattern may be used to detect the presence or absence of EPSPS (EP-i), 
according to the methods described within the invention. It is appreciated that the skilled 
5 artisan, in view of the foregoing disclosure, would be able to produce nucleic acid sequences 
which encode the fragments described above, as well as nucleic acid sequences 
complementary thereto. In addition, the skilled artisan using conventional recombinant DNA 
methodologies, for example, by screening a cDNA library with such a nucleic acid sequence, 
would be able to isolate full length nucleic acid sequences encoding marker proteins. 
10 Example 4 Identification of 5-enolpymvylshikimate-3-phosphate synthase in transgenic 
soybean leaves 

Isolation of 5-enolpyruvylshikimate-3 -phosphate synthase from transgenic soybean leaves. 

A comparison of Coomassie stained SDS-PAGE gel patterns of proteins isolated from 
transgenic and non-transformed soybean leaf tissue resulted in the identifiction of EPSPS 

15 protein. Fresh leaf tissue was harvested from soybean plants 14-17 days after germination in 
a controlled environmental chamber and stored at -80 °C until further processing. Three 
leaves were sliced into small (2 mm) pieces and homogenized with a mortar and pestle on dry 
ice and treated with a buffered solution containing 100 mM Tris HC1, pH 7.8, 5 mM 
magnesium chloride, 100 mM sodium borate, 0.2% sodium ascorbate, and 0.05% v/v Tween 

20 80. The homogenate was transferred to a 1.5 mL microfiige tube, sonicated for 5 min., and 
transferred to ice for 10 min. This procedure was. repeated twice. Insolubb cellular debris 
was removed by centrifuging the homogenate for 10 min. at 14,000 g. The supernatant was 
transferred to a clean 1.5 mL microfuge tube. The Bio-Rad Protein Assay kit using a bovine 
gamma globulin standard was used to determine the protein concentration of the supernatant. 

25 The resulting soluble proteins were subjected to high-resolution SDS-PAGE analysis. 

Gel electrophoresis was performed according to standard procedures. Criterion™ precast 1 
mm thick4-20% gradient Tris-HCl gels (Bio-RAD, Hercules, CA) were used throughout, 
although gels with other acrylamide concentrations, thicknesses and numbers of wells can be 
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used, depending on the desired molecular mass range, and sample volumes. Electrophoresis 
was carried out on a Criterion mini electrophoresis cell (Bio-RAD) at room temperature. The 
electrophoresis running buffer was prepared from a 10X Tris-glycine-SDS solution (Bio- 
RAD). Following assembly of the gel system and addition of the running buffer, the soluble 
5 proteins were denatured in a Laemmli Sample Buffer (Bio-Rad) by boiling for 2 min. The 
protein/sample buffer solutions were vortexed and centrifuged prior to loading onto the gel. 
Electrophoresis was carried out at an initial current of 35 mA using a EC 105 power supply 
and was terminated when the dye front reached the bottom of the gel. Typical run times were 
1.5 h. Molecular weight standards consisted of myosin (M.sub.r 205,000), 0-galactosidase 
10 . (M.sub.r 120,000), bovine sierum albumin (M.sub.r 84,000), ovalbumin (M.sub.r 52,200), 
carbonic anhydrase (M.sub.r 36,300), soybean trypsin inhibitor (M.sub.r 30,200), lysozyme 
(M.sub.r 21,900), and aprotinin (M.sub.r 7,400)(Bio-Rad). Following electrophoresis, gels 
were stained in 0.2% Coomassie Brillant Blue G-250 and destained 
5-enolpymvylshilrimate-3-phosphate synthase (EPSPS) was identified by SDS-PAGE. 
15 As described in the previous above, SDS-PAGE was obtained by separating and 

resolving proteins isolated from either non-transformed or transgenic soybean leaves 
expressing EPSPS. Fig. 2 shows a SDS-PAGE gel displaying proteins from a typical protein 
pattern by transgenic soybean leaves expressing EPSPS. 

Comparison of protein patterns (Fig. 4) reveals that, while most proteins in the non- 
20 transformed and transformed soybean leaves are identical, one protein having a molecular 
weight of approximately 48 kDa is unique to the transgenic soybean leaf samples (labeled in 
Fig. 4). The protein labeled EPleaf-1 was reproducibly detected in all samples analyzed 

which had been previously identified as being transgenic for EPSPS. 

-. 

Characterization and identification of EPleaf- 1 

25 One protein band corresponding to EPleaf-1 was excised, proteolytically cleaved, and 

analyzed by mass spectrometry as described herein. Using the methodology described above, 
EPleaf-1 (MW 48 kDa) was isolated from 2-3 lanes on a SDS-PAGE gel of transgenic 
soybean seeds encoding the EPSPS gene, digested with trypsin, and the peptides were 
extracted. The resulting peptide mixture was analyzed directly by MALDI-MS. Correlation 

30 of the determined masses with the EPSPS sequence is shown in Table 6. Sixteen diagnostic 
tryptic peptide masses were identified, which covers 53% of the EPSPS sequence. 
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Table 6. Peptide masses obtained by MALDI-MS of the 48 kDa EPleaf-1 protein band 
digested with trypsin. 



Observed 


Predicted 


Sequence 


Residues 


M+H 


M 






616.2 


615.8 


RPMGR 


128-132 


711.2 


710.9 


VLNPLR 


133-138 


832.3 


832!0 


MSHGASSR 


i-8 


835.2 


835.0 


AMQAMGAR 


62^69 


862.1 


862.9 


GLSGRVR 


15-23 


872.1 


872.0 


SAVANGLK 


358-366 


872.0 


871.9 


GVTVPEDR 


313-320 


948.1 


948.1 


TPTPITYR 


161-168 


1115.0 


1115.2 


LAGGEDVADLR 


295-305 


1358.9' 


1359.5 


SFMFGGLASGETR 


34-46 


1464.9 


1465.2 


EMGVQVKSEDGDR 


139-151 


1645.9 


1646.8 


GLGNASGAAVATHLDHR 


389-405 


1993.7 


1994.2 


MLOGFGANLTVETDADGVR 


206-224 


2182.6 


2183.5 


TGLILTLOEMGADIEVINPR 


275-294 


2366.6 


2367.8 


SAVLLAGLNTPGITTVTEPIMTR 


178-200 


3250.2 


3250.7 


APSMIDEYPILAVAAAPAEGATVMNGLEE 
LR 


321-351 


4190.2 


4189.9 


LTGQVIDVPGDPSSTAFPLVAALLVPGSD 
VTILNVLMNPTR 


234-274 



Underlining reflects sequences confirmed by MALDI-PSD. 

Sequence tags were obtained from EPleaf-1 to definitively identify the protein by 
5 mass spectrometry. Post source decay (PSD) fragment ion spectra were acquired for five 
peptides (832.3, 1358.9, 1646.8, 1993.7, and 2182.6 m/z) after isolation of the appropriate 
precursor ion by using timed ion selection. Fragment ions were refocused as described 
herein. The individual segments were stitched together by using software provided by 
PerSeptive Biosystems. The MALDI-PSD analysis verified the amino acid sequence for each 
10 of the five-peptide masses. 
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Example 5. Detection and quantification of 5-enolpymvylshikimate-3-phosphate synthase in 
soybean tissue 

Method to enrich for EPSPS in plant tissue. 

Once EPSPS was identified in both seed and leaf tissue, a method to enrich for EPSPS 
5 was employed to assist rapid detection and quantification of the target protein from plant 
tissue. All protein enrichment procedures were carried out on ice. EPSPS isolation from leaf - 
and seed were done as previously described herein, except the extraction buffer consisted of 
100 mM Tris HC1, pH. 7.8, 5 mM magnesium chloride, 100 mM sodium borate, and 0.2% 
sodium ascorbate. The slurry was transferred to a 2 mL Kontes glass issue grinder for further 
10 homogenization: The homogenate was transferred to a 1.5 mL microfuge tube, sonicated for 
5 min., and chilled on ice for 10 minutes. This procedure was repeated twice. Insoluble 
cellular debris -was removed by centrifuging the homogenate for 10 min. at 14,000 g. The 
supernatant was decanted into a clean 1.5 mL microfuge tube. The Bio-Rad Protein Assay kit . 
using a bovine gamma globulin standard was used to determine the protein concentration of 
15 the supernatant. 

The resulting soluble proteins were loaded onto 200 jxL of equilibrated Q-Sepharose 
resin in a 1.5 mL microfuge tube and gently inverted several times for 10 minutes. Q- 
Sepharose resin and protein complexes were centrifuged at 3,000 g for 10 seconds. Unbound 
soluble proteins were removed by washing with 3 volumes of buffer. In 100 mM increments, 
20 starting with 100 mM NaCl to 1,000 mM NaCl the resins were washed with each salt 
concentration to fractionate proteins. Each fraction was collected and analyzed by SDS- 
PAGE, followed by excising, proteolytic cleaving, and analyzing by MALDI-TOF MS. 
Peptide mass fingerprints were used to verify the presence or absence of EPSPS in the various 
tissues. 

25 EPSPS eluted from fractions washed with 300-400 mM NaCl. 

The resulting enrichment of EPSPS was found to be nearly 50% of the protein from 
leaf and 10% of the protein from seed. 

Use of modified peptides for detection and quantification of EPSPS. 

It is possible to use MALDI-TOF MS to analyze any compound in a sample after the 
30 analytes of interest have been extracted from a sample matrix. Key issues include the 
selection of matrixes, preparation of matrix and sample, signako-noise ratio, spot-to-spot 
reproducibility, and response factors. Because of spotto-spot variation, an internal standard . 
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is a requirement for quantification of analytes using MALDI-TOF MS. In this example, two 
EPSPS tryptic peptides were selected for internal standards. Several criteria were used to 
select the two peptides. First, fee two peptides needed to be located near the C-terminal and 
N-Terminal of EPSPS, this would allow for the monitoring of protein processing. The second 
5 requirement is that both peptides need to be detected under lower laser strength with good 
spot-to-spot reproducibility and high sensitivity. Third, each peptide needed to be modified 
such that the peptide mass was not overlapping the native peptide mass (precursor peptide 
mass) and/or other signature or non-signature diagnostic peptides. The fourth requirement 
was to have peptides, which could easily be synthesized. Two peptides were selected from 
10 the peptide mass fingerprints from EPSPS isolated from seed and leaf as described herein, 
which met the above criteria (Table 7). 



Table 7. Selected peptides for monitoring and quantifying EPSPS. 



Sequence 


Residues 


Mass 


Modificatio 
n 

(acetylatio 
n) 


Modified 

peptide 

mass 


MP*1 SFMFGGLASGETR 


34-46 


1359.5 


42 


1401.5 


MP-2 

GLGNASGAAVATHLDHR 


389-405 


1646.8 


42 


1688.8 



Both peptides were synthesized by solid phase and the N-terminal of each peptide was 
modified by acetylation (Genemed Synthesis Inc, South San Francisco, CA). This 

15 modification increases the mass of each peptide by 42 Da (Table 7). 
Calibration curve construction 

To establish the detection sensitivity, calibration curves for each peptide were 
constructed using known amounts of the two synthetic peptides. Various concentrations of 
peptide solutions were prepared and subjected to MALDI-TOF MS analysis. Synthetic 

20 peptides were analyzed at amounts ranging from 15.6 finol, 31.2 finol, 62.5 finol, 125 finol, 
250 finol, and 500 frnol for both peptides. Calibration curves were also validated with spiked 
modified peptides in plant extracts or enriched target proteins. Various concentrations of 
peptide solutions were prepared and analyzed blind by MALDI-TOF MS. All calibration 
points were examined in triplicate and a linear correlation was observed for both peptides 
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over this concentration range (Figs. 5A and 5B). These blind tests demonstrated the inter-run 
reproducibility of the calibration curves. The results demonstrate levels of detection below 
lOfemtomoles. 

Rapid screening of EPSPS in soybean leaf tissue 
5 The techniques of the present invention were used to monitor for the expression of 

EPSPS in soybean leaf tissue. Ten different soybean plants were grown; one plant that did 
not express EPSPS was used as a control. Plant tissue was homogenized as described herein 
and 50 fig of total protein were loaded onto SDS-PAGE. A corresponding protein at a 
molecular weight of 48 kDa was excised, proteolytically cleaved with trypsin, and analyzed 

10 by MALDIrTOF MS. In Table 8, diagnostic peptide masses were identified in soybean leaf 
tissue from nine distinct samples. In the control soybean leaf material no peptides 
corresponding' to EPSPS were identified. In each of the transformed lines with EPSPS 
signature diagnostic peptide masses were detected. It should be noted that this example 
provided the ability to detect signature diagnostic peptide masses as a means to monitor for 

15 EPSPS. 



Table 8. Screening for diagnostic peptide mass fingerprints of EPSPS in soybean leaf tissue. 





Diagnostic peptide masses m/z 


Sample 


616 


711 


862 


1115 


1358 


1645 


1993 


2182 


2366 


3256 


Control 






















SLP-1 


+ 


+ 


+ 


+ 


+ 


+ 




+ 


+ 




SLP-2 


+ 






+ 


+ 




+ 


+ 


+ 




SLP-3 


+ 


+ 


+ 


+ 




+ 


+ 


+ 




+ 


SLP-4 


+ 


+ 


+ 


+ 








+ 


+ 




SLP-5 


+ 




+ 


+ 


+ 


+ 


+ 


+ 


+ 




SLP-6 


+ 




+ 


+ 


+ 


+ 


+ 


+ 




+ 


SLP-7 


+ 


+ 




+ 


+ 


+ 




+ 




+ 


SLP-8 




+ 




+ 




+ 


+ 


+ 


+ 




SLP-9 • 


+ 


+ 


+ 


+ 


+ 




+ 


+ 


+ 


+ 



+ = presence of peptide mass, - = absence of peptide mass 
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Quantitation of EPSPS from a SDS-PAGE 

On the basis of the high sensitivity achieved for MALDI-TOF MS in the detection of 
EPSPS a protein at 48 kDa was excised from SDS-PAGE, and the method was used as an 
alternative technique to quantify EPSPS in soybean tissues of both seeds and leaves. 
5 Modified-peptides were added to "in-gel" tryptic peptides used as internal calibrants for 
monitoring, detecting, and quantifying EPSPS (Figs. 6 A and 6B). Tables 9 and 10 list the 
EPSPS content of 6 soybean leaf samples and Table 11 lists the EPSPS content of 10 soybean 
seed samples as determined by MALDI-TOF MS analysis. Five soybean leaf samples and 
nine soybean seed samples had EPSPS peptide masses which allowed quantification within 
10 the linear range. Two of the-samples did not result in the detection of EPSPS peptide masses. 
Table 9. MALDI-TOF MS quantification of EPSPS in soybean leaves from separation from 
SDS-PAGE using MP-1 spiked calibrant 



Sample 


1359 
(mean) 


Spike 1401 
(mean) 


Ratio 


MP-1 
(finol) 


EPSPS 
(finol) 


Leaf 1 


51652 


18964 


2.7 


250 


.681 


Leaf 2 


70638 


13848 


5.1 


250 


1275 


Leaf 3 


55013 


13371 


4.1 


250 


1028 


Leaf4 


32147 


11340 


2.8 


250 


708 


Leaf 5 


60520 


11847 


5.1 


250 


1277 



Table 10. MALDI-TOF MS quantification of EPSPS in soybean leaves from separation from 
SDS-PAGE using MP-2 spiked calibrant. 



Sample 


1646 


Spike 1688 


Ratio 


MP-2 


EPSPS 




(mean) 


(mean) 




(finol) 


(finol) 


Leaf 1 


41369 


9723 


4.2 


250 


1050 


Leaf 2 


28429 


6800 


4.2 


250 


1050 


Leaf 3 


30466 


7377 


4.1 


250 


1025 


Leaf4 


21970 


4788 


4.6 


250 


1150 


Leaf 5 


29338 


5338 


5.5 


250 


1375 



15 Table 11. MALDI-TOF MS quantification of EPSPS in soybean seeds from separation from 
SDS-PAGE using MP-1 spiked calibrant. 
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Sample 


1359 


Spike 1401 


Ratio 


MP-1 


EPSPS 




(mean) 


(mean) 




(finol) 


(finol) 


Seedl 


33828 


17534 


1.9 


250 


475 


Seed 2 


22158 


23664 


0.94 . 


250 


235 


Seed 3 


21743 


21214 


1.02 


250 


255 


Seed 4 


13657 


20394 


0.67 


250 


167.5 


Seed 5 


16257 


19216 


0.85 


250 


212.5 


Seed 6 


17439 


22830 


0.76 


250 


190 


Seed 7 


15441 


21434 


0.72 


250 


180 


Seed 8 


11496 


19764 


0.58 


250 


145 


Seed 9 


12226 


18069 


0.68 


250 


170 



Quantitation of EPSPS in soybean leaves 

All soybean leaves samples were analyzed using the resinrdeveloped method. MP-1 
and MP-2 were used as an internal calibrants and added to the enriched EPSPS tryptic digest. 
The results are presented in Tables 12 and 13. These data indicate that MALDI-TOF MS 
5 coupled with a resin-based method accurately determined EPSPS levels in soybean leaves. 



Table 12. MALDI-TOF MS quantification of EPSPS in soybean leaves after enriching by Q- 
Sepharose resin using MP-1 spiked calibrant. 



Sample 


1359 


Spike 1401 


Ratio 


MP-1 


EPSPS 




(mean) 


(mean) 




(finol) 


(finol) 


Leaf 1 


38248 


14469 


2.64 


250 


661 


Leaf2 


20571 


8757 


2.35 


250 


587 


Leaf3 


46781 


14824 


3.16 


250 


789 


Leaf4 


32040 


9560 


3.35 


250 


838 


Leaf 5 


24204 


12500 


1.94 


250 


484 


Table 13. MALDI-TOF MS quantification of EPSPS in soybean leaves after enriching by Q- 


Sepharose resin using MP-2 spiked calibrant. 








Sample 


1646 


Spike 1688 


Ratio 


MP-2 


EPSPS 




(mean) 


(mean) 




(finol) 


(finol) 
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Leaf 1 


24667 


7157 


3.45 " 


250 


861 


Leaf 2 


17196 


5602 


3.07 


250 


767 


Leaf 3 


15202 


4638 


3.28 


250 


819 


Leaf4 


20469 


6598 


3.10 


250 


775 


Leaf 5 


22369 


7953 


2.81 


250 


703 



Example 6. Detection and quantification of 5-enolpyruvylshikimate-3-phosphate synthase in 
maize tissue 

As demonstrated in previous examples, EPSPS can be quantified in different dicot 
tissues such as seed and leaves. However the method is readily applicable to other plant 
5 species. Therefore, samples of leaf samples from maize (a monocot plant) were analyzed 
using the resin-developed method developed for soybean leaves. MP-1 was used as an 
internal calibrant and added to the enriched EPSPS tryptic digest The results are presented in 
Table 14. These data indicate that MALDI-TOF MS coupled with a resin-based method can 
accurately determine EPSPS levels in maize leaves. The low limit of detection demonstrates 
10 the high sensitivity of MALDI-TOF MS for identification of EPSPS. 



Table 14. MALDI-TOF MS quantification of EPSPS in soybean leaves after enriching by Q- 
Sepharose resin using MP-1 spiked calibrant 



Sample 


1359 
(mean) 


Spike 1401 
(mean) 


Ratio 


MP-1 
(fmol) 


EPSPS 
(fmol) 


Leaf 1 


14848 


25302 


0.59 


250 


147 


Leaf 2 


11275 


19858 . 


0.57 


250 


142 


Leaf '3 


14236 


23186 


0.61 


250 


153 


Leaf 4 


18018 


29271 


0. 61 


250 


153 


Leaf 5 


10557 


18515 


0,57 


250 


142 



INDUSTRIAL APPLICABILITY 
15 The present invention has applicability in a wide variety of diagnostic applications 

including quality control. 
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CLAIMS: 

1 . A method of detecting a target protein in a biological sample, comprising: 

(a) obtaining a biological sample; 

(b) extracting proteins from the sample; 

5 (c) concentrating the target protein from.extracted proteins of (b); 

(d) digesting concentrated proteins of (c) to produce peptide fragments of the 
proteins, wherein the fragments include at least one signature peptide specific to 
the target protein; and 

(e) analyzying the peptide fragments via mass spectrometry, wherein detection of 
10 " " the at least one signature peptide is indicative of presence of target protein in the 

sample. 

2. The method of claim 1 wherein said analyzing further comprises (f) calculating 
amount of the protein in the sample by comparing mass spectrometry signals 
generated from the signature peptide and its corresponding internal standard 

15 peptide. 

3 . The method of claim 2 wherein said digesting produces peptide fragments 
containing at least two signature peptides and said analyzing comprises comparing 
mass spectrometry signals generated from at least one of the signature peptides 
and its corresponding internal standard peptide. 

20 4. The method of claim 1 wherein the sample is a plant or derived from a plant. 

5. The method of claim 4 wherein the sample is plant seed. 

6. The method of claim 1 wherein the sample is an animal or derived from an animal. 

7. The method of claim 1 wherein the sample is a microorganism or derived from a 
microorganism. 

25 8. The method of claim 1 wherein the sample is a pharmaceutical or a nutraceutical. 

9. The method of claim 1 wherein the sample is a food product or feed. 

10. The method of claim 1 wherein the sample is a cosmetic. 

1 1 . The method of claim 1 wherein the peptide fragments include at least two 
signature peptides specific to the target protein. 

30 12. The method of claim 1 wherein the protein is non-native to source of the sample. 

13. The method of claim 1 wherein said concentrating comprises contacting extracted 
proteins with a resin that preferentally adsorbs the target protein. 
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14. The method of claim 13 wherein said concentrating further comprises contacting 
the extracted proteins with a resin that preferentially absoibs non-target proteins in 
the sample. 

15. The method of claim 3 wherein said digesting is performed with a protease which 
is trypsin, endoprotease-Arg-C, endoprotease-Aspn-N. endoprotease-Glu-C or 
endoprotease -Lys-C. 

16. The method of claim 1 wherein and said analyzing comprises introducing the 
peptide fragments and a matrix into a matrix assisted laser desotption ionization 
(MALD1) time-of-flight (T OF) analyzer. 

1 7. The method of claim 1 6 wherein the matrix comprises alpha-cyano -4- 
hydroxycinniamic acid. 

18. The method of claim 1 wherein said analyzing comprises introducing the peptide 
fragments into an ion trap electrospray ionization apparatus (ESI). . . ^ 

19. The method of claim 1 further comprising (h) introducing the composition intoja 
liquid chromatograph (LC) prior to said analyzing. ^ 

20. : The method of claim 19 wherein the LC is a micro-LC. . , „ 

21 . The method of claim ] 9 wherein the LC is a nano-LC. 

22. A method of quantifying a target protein which is an expression product of a. 
transgene contained in a transgenic plant, tissue or cell, orfor quantifying jhe^ 
target protein in a product derived from the transgenic plant, comprising; -. ^ 

(a) obtaining a sample from the transgenic plant or the product: 

(b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins o£ (b); 

(d) digesting concentrated proteins of (c) with a protease to produce peptide 
fragments of -the proteins, wherein the fragments include at least one.signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one 
signature peptide, a corresponding internal standard peptide; and 

(f) calculating amount of the target protein in the sample by .comparing mass 
spectrome try signals generated from the signature peptide with mass spectrometry signals 
generated by the corresponding internal standard peptide. 
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23 . A method of quantifying a target protein which is an expression product of a 

transgene contained in a transgenic animal, tissue or cell, or for quantifying the 
target protein in a product derived from the transgenic animal, comprising: 
(a) obtaining a sample from the transgenic animal or the product; 
5 (b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) with a protease to produce peptide 
fragments of the proteins, wherein the fragments include at least one signature peptide 
specific to the target protein; 

10 (e) analyzying via mass spectrometry the peptide fragments and for at least one 

signature peptide, a corresponding internal standard peptide; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from the signature peptide with mass spectrometry signals 
generated by the corresponding internal standard peptide. 
15 24. A method of quantifying a target protein which is an expression product of a 

transgene contained in a transgenic microorganism, or for quantifying the target 
protein in a product derived from the transgenic microorganism, comprising: 

(a) obtaining a sample from the transgenic microorganism or the product; 

(b) extracting proteins from the sample; 

20 (c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) with a protease to produce peptide 
fragments of the proteins, wherein the fragments include at least one signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one 
25 signature peptide, a corresponding internal standard peptide, an internal standard; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from the signature peptide with mass spectrometry signals 
generated by the corresponding internal standard peptide. 

25. A method of quantifying a target protein which is an expression product of a 
30 transgene contained in a transgenic plant, tissue or cell, or for quantifying the 

target protein in a product derived from the transgenic plant, comprising: 
(a) obtaining a sample from the transgenic plant or the product; 
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(b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) with a protease to produce peptide 
fragments of the proteins, wherein the fragments include at least two signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one of 
the signature peptides, a corresponding internal standard peptide; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from at least one of the signature peptides with mass 
spectrometry signals generated by the corresponding internal standard peptide. 

26. A method of quantifying a target protein which is an expression product of a 
transgene contained in a transgenic animal, tissue or cell, or for quantifying the 
target protein in a product derived from the transgenic animal, comprising: 

(a) obtaining a sample from the transgenic animal or the product; 

(b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins of (b); 

(d) digesting concentrated proteins of (c) with a protease to produce peptide 
fragments of the proteins, wherein the fragments include at least two signature peptide 
specific to the target protein; 

(e) analyzying via mass spectrometry the peptide fragments and for at least one of 
the signature peptides, a corresponding internal standard peptide; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from at least one of the signature peptides with mass 
spectrometry signals generated by the corresponding internal standard peptide. 

27. A method of quantifying a target protein which is an expression product of a 
transgene contained in a transgenic microorganism, or for quantifying the target 
protein in a product derived from the transgenic microorganism, comprising: 

(a) obtaining a sample from the transgenic microorganism or the product; 

(b) extracting proteins from the sample; 

(c) concentrating the target protein from extracted proteins of (b); 
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(d) digesting concentrated proteins of (c) with a protease to produce peptide 
fragments of the enriched proteins, wherein the fragments include at least two signature 
peptides specific to the target protein; 

(e) analyzying via mass; spectrometry the peptide fragments and for at least one 
5 the signature peptides, a corresponding internal standard peptide; and 

(f) calculating amount of the target protein in the sample by comparing mass 
spectrometry signals generated from at least one of the signature peptides with mass 
spectrometry signals generated by the corresponding internal standard peptide. 
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Fxcpire 1 

Predicted Differential Protease Cleavage Peptides of EPSPS 



Peptide Trypsin 


Lys-c 


Arg-C 


Asp-N 


Glu-C 


Number* 


(KR) 


. (K) 


(R) 


(D) ( 


E) 


1 


1328.5 


1456.6 


1328.5 


2568.9 


4477.0 


2 


... 146.2 


1373 .5 


991.1 


2996.3 


902.1 


• 3 


862.9 


3480.9 


1109.2 


2146 . 5 


204 .2 


4 


528.6, 


1232. 5. 


1359.5 


646 .7 


2 089.5 


5 


;;.5 98.7 


5620.3 


2375.7 


1379.5 


1784 .0 


6 


1359.5 


2080.5 


287.4 


1945.2 


5144.9 


7 


1558.7 


1639.8 


3315.7 


280.3 


877.0 


8 


835.0 


1860.2 


3048.5 


638 . 7 


6098.1 


9 


287.4 


2978.5 


710.9 


2912 .4 


1407.6 


10 


14 6.2 


3005.4 


1449. 6 


190.2 


1345.5 


11 


3187.5 


8224 .5 


697-9 


5443 .5 


5557.5 


12 


2450.8 


4332 • 0 


1230.4 


1991.2 


634.7 


13 


615.8 


1359.5 


3279.9 


2 04 .2 


1025 .4 


14 


710.9 


2316.5 


2604 .8 


2068 .4 


2042 .3 


15 


789.9 


5950 . 9 


388.5 


386.4 


1033 .1 


16 


677.6 


804 . 9 


473 .5 


1742 . 0 


1235 .4 


17 


687 .9 


160.2 


4375.1 


2714 .2 


891.0 


18 


300.4 




2183 .5 


13 82.5 


147 .1 


19 


948 . 1 




1115 .2 


303.3 


643 .8 


20 


93 0 . 1 




273 .3 


1757. 0 


2076 .3 


21 


2367.8 




1388.5 


788. 9 


2 04 .2 


22 


628.6 




3250 .7 


3194 . 6 


4292 . 0 


23 


1994 .2 




732 . 8 


1526.8 


1808 . 0 


24 


388 .5 




2559 .9 


236.2 


1395 .7 


25 


473 .5 




2257.4 


1414 . 5 


704 .8 


26 


203 .2 




5251.1 


1538 . 7 




27 


4189 . 9 






2315 . 7 




28 


2183.5 






133. 1 




29 


1115.2 






1460.7 




30 


273.3 






1317 . 6 




31 


534 . 6 






504 .5 




32 


871.9 










33 


3250.7 










34 


245.3 










35 


505.5 










36 


872 . 0 










37 


1705. 9 










38 


628 . 7 










39 


1646. 8 








(amino acid 


cleaved) 












40 


4322 . 1 








single amino 
acid 


41 


804 . 9 






preferred size 
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160.2 



peptides 
* from N- terminus 
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Figure 2 
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Figure 3A 
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Figure 3B 
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Figure 5A 
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Figure 5B 
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Figure 6B 



100j 
90 
80 
70 

^ 60 
J 50 
5 s 40 
30 
20 
10 



13593 



13603 



,357.3 
358.3 

,1362.2 



"1646.3 
lt647.3 



1648.3 



1688.3 



1330 



1649.3 J 1689 - 3 



1415 



1500 



1585 



1670 



1755 



Mass (m/z) 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO_ 



_02084250A2_L> 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
Internationa] Bureau 

(43) International Publication Date 
24 October 2002 (24.10.2002) 




PCT 



(10) International Publication Number 

WO 02/084250 A3 



(51) International Patent Classification 7 : G01N 1/00 (81) Designated States (national): AE, AG, AL, AM, AT, AU, 



(21) International Application Number: PCT/US02/12014 

(22) International Filing Date: 17 April 2002 (17.04.2002) 
(25) Filing Language: English 



(26) Publication Language: 

(30) Priority Data: 

•60/284,273 
60/284,713 



English 



17 April 2001 (17.04.2001) US 

18 April 2001 (18.04.2001) US 



(71) Applicant (for all designated States except US): FEM- 
TOLINK BIOTECHNOLOGIES LLC [US/US]; 1 1930 
Heritage Oak Place, Suite 5, Auburn, CA 95603 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): EVERETT, 
Nicholas, P. [GB/US]; 604 Apple Tree Lane, Meadow 
Vista, CA 95722 (US). PETELL, James, IC [US/US]; 
9203 W. Gage Boulevard, Kennewick, WA 89336 (US). 
YOUNG, Scott, A. [US/US]; 4400 Partridge Lane, Mid- 
land, MI 48640 (US). 

(74) Agent: FOLEY, Shawn, P.; Lerner, David, Littenberg, 
Krumholz & Mentlik, LLP, 600 South Avenue West, West- 
field, NJ 07090 (US). 



AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TO). 

Published: 

— with international search report 

(88) Date of publication of the international search report: 

10 April 2003 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

o 

ID 
<S 

00 . 

® (54) Title: METHODS FOR MASS SPECTROMETRY DETECTION AND QUANTIFICATION OF SPECIFIC TARGET PRO- 
TEINS IN COMPLEX BIOLOGICAL SAMPLES 



(57) Abstract: Mass spectrometry -based methods are described for the detection or quantification of targeted proteins in biological 
samples e.g., plants, animals, and microorganisms, parts (e.g., tissue or cells) thereof, and products derived from plants, animals or 
microorganisms. 



BNSDOCID: <WO 0208425QA3J_> 



•9 



INTERNATIONAL SEARCH REPORT 



International application Mo. 
PCT/US02/I2014 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(7) : G01N 1/00 

US CL : 436/173 
According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 436/173 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Continuation Sheet 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 3 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



X 
Y 
X 
Y 
Y 



US 5,346,991 A (ROY et al) 13 September 1994 (13.09. i994); see entire document, 
especially column 7, lines 17-43. 



US 5,977,442 A (KLESSIG et al) 2 November 1999 (02.11. 1999); see entire document, 
especially column 23, line 20 to column 25, line 10. 



FAVRETTO et al. Silicon Membrane Interface for the Direct Anaylsis of Kathon CG in 
Aqueous Solutions and Cosmetic Emulsions. Biological Mass Spectrometry, 1991, Vol. 
20, pages 669-676; see entire document, especially the Abstract and page 676. 

NILSSON et al. New Separation Tools for Comprehensive Studies of Protein Expression 
by Mass Spectrometry. Mass Spectrometry Reviews, 2000; Vol. 19, pages 390-397; see 
enure document, especially the Abstract, Section II from pages 391-393 and Section C on 
page 395. 

GYGI et al. Measuring gene expression by quantitative proteome analysis. Current 
Opinion in Biotechnology, 2000, Vol. 1 1 , pages 396-401; see entire document. 



1-3,6-8,11-13,15,16 
and 18-27 

4,5,9,10,14 and 17 
1-3,6-8,11-13,15,16 
and 18-27 



4,5,9,10,14 and 17 
10 



14 and 19-21 



1-27 



X Further documents are listed in the continuation of Box C. | j See patent family 



annex. 



r Special categories of cited documents: 

'A" document defining the general state of the art which is not considered to be 
of particular relevance 

E w earlier application or patent published on or after the international filing date 

L" document which may throw doubts on priority claim(s) or which is cited to 
establish the publication date of another citation or other special reason (as 
specified) 

"O" document referring to an oral disclosure, use, exhibition or other means 

P" document published prior to the international filing date but later than the 
priority date claimed 



T" later document published after the international filing date or priority 

date and not in conflict with the application but cited to understand the 
principle or theory underlying the invention 

"X w document of particular relevance; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

"Y" document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 
26 December 2002 (26. 12.2002) 



Name and mailing address of the ISA/US 

Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 

Facsimile No. (703)305-3230 



Form PCT/ISA/210 (second sheet) (July 1998) 



Date of raaitoQjf 



rearch report 



rid officer 
Telephone No. (703) 308-0196 



la A. Lambertson (J 



INTERNATIONAL SEARCH REPORT 



PCT/US02/I2014 



C. (Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



• ALOMIRAH et al. Applications of mass spectrometry to food proteins and peptides. 
Journal of Chromatography A, 2000, Vol. 893 , pages 1-21; see entire document, 
especially pages 1-2, the Abstract and the Introduction. 

WILSON: Multiple hyphenation of liquid chromatography with nuclear magnetic 
resonance spectroscopy, mass spectrometry and beyond. Journal of Chromatography A, 
2001, Vol. 892, pages 315-327; see entire document, especially pages 315-316 (the 
Abstract and Introduction), 318-318 (section on Drug Metabolites) and page 321 (the 
section on Natural Products). 



8 and 9 



4,5 and S 



Form PCT/ISA/210 (second sheet) (July 1998) 



BNSOOCID: <WO 02084250A3J_> 



INTERNATIONAL SEARCH REPORT 



PCT/US02/120I4 



Continuation of B, FIELDS SEARCHED Item 3: 
WEST, MEDLINE, DIALOG 

Search Terms: mass spectrometry/spectroscopy, MALDI, MS, protein/polypeptide, idenufy/idenuficaUon, 
extract/extraction/extracting, digest/digesting/digesdon/proteolysis, concentrate/concentrating 



FormPCT/ISA/210 (second sheet) (July 1998) 



BNSDCCID: <WO 02084 2S0A3J_> 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT OR DRAWING 
BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 




